Application No. 10/588,570 

Response to Final Office Action dated September 14, 2010 



REMARKS 

Reconsideration of the present application in view of the above amendments and 
the following remarks is respectfully requested. 

Claims 1, 2, 6-9 and 14-34 are pending, of which claims 1, 2, 14-27, 33 and 34 are 
under consideration. 

Claim 1 has been amended to clarify the claimed subject matter without 
acquiescing to the rejections in the Office Action. Support for the amendments to claim 1 may 
be found, for example, at page 10, first paragraph, of the present application. No new matter has 
been added via the amendments to the claim. 

The specification has been amended to add a sequence identification number for 
the sequence of the A motif of family A DNA polymerases or their Klenow fragments, 
DYSQIELR, A new sequence listing is provided herewith to add the missing sequence 
identification number for the above-noted sequence. Additionally, the application has been 
amended to insert the paragraph that incorporates the text copy of the Sequence Listing, as filed 
electronically via EFS-Web. Applicants respectfully submit that no new matter has been added 
via the amendments to the specification or to the sequence listing submitted herein. Applicants 
further submit that the above-identified application is now in compliance with 37 C.F.R. §§ 
1.821-1.825 and WIPO Standard 25. 

Election/Restrictions 

Applicants respectfully request that because there is no prior art-based rejection 
against the elected species, the search and examination be extended to species other than the 
elected species, i.e., a polymerase from Thermus aqmticus. In addition, as discussed below. 
Applicants believe that claim 1 and its dependent claims under consideration are allowable. 
Accordingly, Applicants further respectfully request that the withdrawn claims, which all 
ultimately refer to claim 1 and thus incorporate the features recited in claim 1, be examined in the 
present application. 
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Objection to Specification 

The specification stands objected to because the amino acid sequence 
"DYSQIELR" recited in paragraph 6 on page 14 does not have a sequence identifier. 

Applicants thank the Examiner for noting such an informality. Applicants have 
amended the specification to eliminate this informality and submitted a corrected sequence 
listing to include the sequence identifier for the above-noted sequence. Accordingly, Applicants 
submit that this ground of objection has been overcome. Withdrawal of this objection is 
respectfully requested. 

Rejection Under 35 U.S.C. 1 12 (Written Description) 

Claims 1, 2, 14-27, 33 and 34 stand rejected under 35 U.S.C. 112, first paragraph, 
as failing to meet the written description requirement. More specifically, the Office Action refers 
to the rejection in the previous Office Action, which is reproduced below: 

The specification, however, only provides the representative species of Taq and E. 
coli DNA polymerase mutants encompassed by these claims. There is no 
disclosure of any particular structure to function/activity relationship in the single 
disclosed species. The specification also fails to describe additional 
representative species of these family A DNA polymerases by any identifying 
structural characteristics or properties other than the activities in claims 1, for 
which no predictability of structure is apparent .... While applicants have 
functionally defined the claims in terms of the result of a motif C modification of 
a family A DNA polymerase, the claims are not limited structurally in any way, 
such that since the claims are drawn to a family A DNA polymerase which has 
been modified, the claims no longer have any structural limitations. Given this 
lack of species representative of such an unlimited genus of modified DNA 
polymerases, as encompassed by the claims. Applicants have failed to sufficiently 
describe the claimed invention, in such fiiU, clear concise, and exact terms that a 
skilled artisan would recognize Applicants were in possession of the claimed 
invention. 

Applicants respectfully traverse this ground of rejection. The written description 
requirement is met if the applicant conveys to those skilled in the art that, as of the filing date 
sought, he or she was in possession of the invention. Vas-Cath, Inc. v. Mahurkar, 935 F.2d 
1555, 1563-64; 19 USPQ 1111 (Fed. Cir. 1991). "A claim will not be invalidated on section 1 12 
grounds simply because the embodiments of the specification do not contain examples explicitly 
covering the full scope of the claim language. This is because the patent specification is written 
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for a person of skill in the art, and such a person comes to the patent with the knowledge of what 
has come before. Placed in that context, it is urmecessary to spell out every detail of the 
invention in the specification; only enough must be included to convince a person of skill in the 
art that the inventor possessed the invention and to enable such a person to make and use the 
invention without undue experimentation." Lizard-Tech, Inc. v. Earth Resource Mapping, PTY, 
Inc., 424 F.3d 1336, 1345; 76 USPQ2d 1724 (Fed. Cir. 2005). 

In the response to the previous Office Action, Applicants argue that claim 1 of the 
present application is directed to a family A DNA polymerase or its Klenow fragment that has a 
modified motif C sequence and an enhanced mismatch discrimination as compared to the 
corresponding wild type polymerase or its Klenow fi-agment. In the modified motif C sequence, 
at least the amino acid residue Q879 in the wild type motif C sequence QVH in positions 879- 
881 of the E. coli DNA polymerase BClenow fi*agment shown in SEQ ID N0:2 has been replaced 
by a lipophilic amino acid residue. The term "family A DNA polymerase" is defined in the 
present application as referring to DNA polymerizing enzymes that contain the A motif with the 
sequence DYSQIELR in their active sites. Thus, the family A DNA polymerase or its Klenow 
firagment according to claim 1 of the present application has the following two structural 
features: (1) it has the A motif with the sequence DYSQIELR in its active site, and (2) it has a 
modified motif C sequence in which at least the amino acid residue Q879 in the wild type motif 
C sequence QVH in positions 879-881 of the E. coli DNA polymerase Klenow fragment has 
been replaced by a lipophilic amino acid residue. In addition, the family A DNA polymerase or 
its Klenow fragment claimed in the present application has the following two functional features: 
(1) it has DNA polymerase activity, and (2) it has an enhanced mismatch discrimination as 
compared to the corresponding wild type polymerase or its Klenow fragment. 

In the previous response. Applicants further argued that one of ordinary skill in 
the art would not doubt that the present inventors had possession of the claimed invention at the 
filing of the present application. First, such a person would not doubt that the inventors had 
possession of a family A DNA polymerase . Applicants disagree with the assertion in the Office 
Action that the claims are not limited structurally in any way. As indicated above, because the 
present application defines the term "family A DNA polymerase" as referring to a DNA 
polymerase that contains the A motif with the sequence DYSQIELR in its active site, the claimed 
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family A DNA polymerase or its Klenow fragment has motif A with the sequence DYSQIELR in 
its active site. Applicants submit that family A DNA polymerases and its Klenow fragments 
were well known and characterized in the art at the filing of the present application. More 
specifically, as indicated in Patel et al. (J. Mol. Biol. 308:823-837, 2001, "Patel," copy enclosed), 
over 50 family A polymerases from different prokaryotic species had been sequenced {see. 
Abstract). Sequence alignments of these and other related polymerases show six conserved 
regions that form the active sites with regions 3, 4 and 5 corresponding to most conserved motifs 
A, B and C, respectively {see, pages 825 and 826, Figures 2 and 3). In addition, high-resolution 
crystal structures of several DNA polymerases had also been determined {see, page 824 and 
Figures 1 and 4-7). These studies in combination with mutagenesis analyses illustrated the 
structure-fimction relationship of family A DNA polymerases, including the fiinction of 
individual amino acid residues in polymerization {see, pages 832-834). Thus, although not 
explicitly described in the present application, numerous family A DNA polymerases and their 
structure-function relationship were known in the art. In view of the advance understanding of 
family A DNA polymerases in the art at the time the present application was filed, one of 
ordinary skill in the art would not doubt that the present inventors had possession of a family A 
DNA polymerase other than Taq and E. coli DNA polymerases explicitly described in the present 
application. 

Second, one of ordinary skill in the art would not doubt that the present inventors 
had possession of a family A DNA polymerase with an enhanced mismatch discrimination as 
compared to the corresponding wild type polymerase . As discussed above, the claimed DNA 
polymerase or its Klenow fragment has an additional structural feature: It has a modified motif C 
sequence in which at least the amino acid residue Q879 in the wild type motif C sequence QVH 
in positions 879-881 of the E. coli DNA polymerase Klenow fragment has been replaced by a 
lipophilic amino acid residue. The present application provides numerous exemplary 
substitutions of the QVH sequence {see, the third frill paragraph on page 11). It ftirther shows 
that certain exemplary substitutions of the QVH sequence resulted in enhanced mismatch 
discrimination activity of the modified DNA polymerases {see, Examples 3, 5 and 6, Figures 1- 
4). In view of such disclosure provided by the present application, Applicants submit that one of 
ordinary skill in the art would not doubt the possession of a family A DNA polymerase with an 
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enhanced mismatch discrimination as compared to the corresponding wild type polymerase by 
the present inventors. 

In response to Applicants' previous arguments, it is asserted in the Office Action: 

It continues that applicants [sic] claims are drawn to any family A DNA 
polymerase or its Klenow fragment having a modified motif C sequence and an 
enhanced mismatch discrimination as compared to the corresponding wild type 
polymerase to is Klenow fragment, wherein in the modified motif C sequence at 
least the amino acid residue Q879 has been replaced with a lipophlic amino acid 
residue. Thus applicant's [sic] claims are clearly drawn to "mutant" family A 
DNA polymerases. Such is supported by applicant's specification at page 10, line 
4, in which applicants state ". . . they also include thermostable DNA 
polymerases and their mutants." 

Thus while applicants submit that the term "family A DNA polymerase" is 
defined in the present application as referring to DNA polymerizing enzymes that 
contain the A motif with the sequence DYSQIELR in their active site, applicants 
claims are drawn to "mutant family A DNA polymerases" for which the structural 
limitations associated with applicants referred to "family A DNA polymerases" 
are not necessarily associated with the claimed mutant family A DNA 
polymerases". Thus it remains that applicants [sic] only structural limitations 
refer to the mutated amino acid position and not to the remainder of the "mutated 
family A DNA polymerase". 

Thus the functional features of having DNA polymerase activity, and especially an 
enlianced mismatch discrimination activity are not associated with any structural 
features including the motif A with the sequence DYSQIELR in its active site. 
While the art may describe family A DNA polymerases (Patel et al. (J. Mol. Bio. 
308:823-837, 2001, "Patel," copy enclosed), applicants claims are not so limited. 

Applicants disagree with the interpretation of the claims in the Office Action. 

Applicants submit that one of ordinary skill in the art, in view of the present application 

(especially the explicit definitions of "family A DNA polymerases" and "DNA polymerase 

according to the invention" provided in the application), would interpret that the claimed family 

A DNA polymerase or its Klenow fragment has the A motif with the sequence DYSQIELR and 

DNA polymerase activity. More specifically, the present application provides: 

"Family A DNA polymerases" (also referred to as "polymerases I") are those 
DNA-polymerizing enzymes which contain the A motif with the sequence 
DYSQIELR in their active site. They also include the enzymes described herein 
which have mutations in the C motif. In particular, they also include thermostable 
DNA polymerases and their mutants, {see, first full paragraph on page 10) 
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The present application forther provides: 

A "DNA polymerase according to the invention" is a family A DNA polymerase 
as defined above which includes the A motif with the sequence DYSQIELR in its 
active site and comprises particular mutations in the C motif. In particular, they 
also include thermostable DNA polymerases with mutations in the C motif 
These mutations are conservative substitutions of the QVH amino acid residues of 
the C motif and/or the above defined non-conservative substitutions, (see, sixth 
paragraph on page 14) 

The interpretation in the Office Action that the Family A DNA polymerase 
claimed in the present application may contain mutations in the A motif directly contradicts the 
above explicit definitions, and is thus incorrect. Although Family A DNA polymerases may 
include thermostable DNA polymerase mutants, one of ordinary skill in the art would know, 
based on the definition provided in the present application, that such mutants do not have 
mutations in the A motif, but in the C motif. 

Nevertheless, to facilitate allowance and to avoid misinterpretation of the claims. 
Applicants have amended claim 1 to specifically recite that tiie family A DNA polymerase or its 
Klenow fragment comprises the A motif with tiie sequence DYSQIELR. Accordingly, as 
indicated in Applicants' previous response, the family A DNA polymerase or its Klenow 
fi:agment according to claim 1 of the present application has the following two structural 
features: (1) it has the A motif with tiie sequence DYSQIELR in its active site, and (2) it has a 
modified motif C sequence in which at least the amino acid residue Q879 in the wild type motif 
C sequence QVH in positions 879-881 of tiie E. coli DNA polymerase Klenow fragment has 
been replaced by a lipophilic amino acid residue. In addition, the family A DNA polymerase or 
its Klenow fragment claimed in the present application has the following two functional features: 
(1) it has DNA polymerase activity, and (2) it has an enhanced mismatch discrimination as 
compared to tiie corresponding wild type polymerase or its Klenow fragment. Moreover, also as 
indicated in AppUcants' previous response, family A DNA Polymerases and their Klenow 
fragments were well known and characterized in the art at the filing of the present application 
{see, Patel, previously submitted). Thus, one of ordinary skill in the art would not doubt that the 
present inventors had possession of family A DNA polymerases other than Taq and E. coli DNA 
polymerases explicitly described in the present application. Furthermore, the present application 
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provides numerous exemplary substitutions of the QVH sequence in the C motif (see, the third 
foil paragraph on page 11) and experimental data showing that certain exemplary substitutions 
resulted in an enhanced mismatch discrimination activity of the modified DNA polymerases (see. 
Examples 3, 5 and 6, Figures 1-4). Accordingly, one of ordinary skill in the art would not doubt 
the possession of the family A DNA polymerase with an enhanced mismatch discrimination 
activity claimed in the present application by the present inventors at the time of the present 
application. 

In view of the above remarks, Applicants submit that this ground of rejection 
under 35 U.S.C. 112, first paragraph, has been overcome. Applicants respectfolly request that 
this rejection be withdrawn. 

Rejection Under 35 U.S.C. 1 12 (Enablement) 

Claims 1, 2, 14-27, 33 and 34 stand rejected under 35 U.S.C. 112, first paragraph, 
as not enabled. More specifically, it is asserted in the Office Action that the specification, while 
enabling for a family A DNA polymerase comprising the amino acid sequence of SEQ ID N0:2 
in which the amino acid residue at position Q879 has been replaced with a lipophilic amino acid 
residue, does not reasonably provide enablement for any family A DNA polymerases that have a 
modified motif C sequence and an enhanced mismatch discrimination as compared to a 
corresponding wild type polymerase or its Klenow fragment in which at least the amino acid 
residue Q879 in the motif C sequence QVH at positions 879-881 based on the E. coli DNA 
polymerase Klenow fi'agment has been replaced by a lipophilic amino acid residue. 

Applicants respectfiiUy traverse this ground of rejection. It seems that the non- 
enablement rejection in the Office Action is primarily based on the assertion that the present 
claims place minimal if any structural limits on the claimed modified polymerases. As discussed 
above. Applicants respectfiiUy disagree with this assertion. However, to facilitate allowance and 
without acquiescing to the rejection in the Office Action, Applicants have amended claim 1 to 
specifically recite that the family A DNA polymerase or its Klenow fragment comprises the A 
motif with the sequence DYSQIELR. Thus, the claimed DNA polymerase has the following two 
structural features: (1) it has the A motif with the sequence DYSQIELR in its active site, and (2) 
it has a modified motif C sequence in which at least the amino acid residue Q879 in the wild type 
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motif C sequence QVH in positions 879-881 of the E. coli DNA polymerase Klenow fragment 
has been replaced by a lipophilic amino acid residue. The first structural feature relates to the 
DNA polymerase activity of the claimed modified family A DNA polymerase, and the second 
structural feature relates to the enhanced mismatch discrimination activity of the claimed 
modified family A DNA polymerase. As discussed in detail below, one of ordinary skill in the 
art would know how to make (and use) the claimed DNA polymerase in view of the present 
application in combination with knowledge in the art available when the present application was 
filed. 

In the previous response. Applicants argued that one of ordinary skill in the art 
would know how to make (and use) a family A DNA polymerase based on the knowledge 
available at the filing of the present application. As discussed above, family A DNA 
polymerases were well known, and their structure-function relationship was well characterized. 
For example, as described in Patel, family A DNA polymerases have 6 conservative regions that 
form their active sites, and the tertiary structures of several family A DNA polymerases were 
characterized. In addition, function of individual amino acids was further studied, which shows 
that very few (<10) amino acid residues within the highly conserved motifs A, B and C have a 
direct role during nucleotide binding and incorporation, and only those residues that are 
important during catalysis and/or for protein folding need to be maintained, while all other 
residues are mutable {see, the Conclusions section on page 835, the first full paragraph in the 
right column on page 833 and the first paragraph in the right column on page 834, and the second 
paragraph in the left column on page 835). Thus, both the regions of family A DNA polymerases 
that may be modified without affecting polymerase activities and the great tolerance of this type 
of polymerase to modification were known in the art. In view of such knowledge in the art, one 
of ordinary skill in the art would be able to make family A DNA polymerases, including both 
naturally occurring family A DNA polymerases and their variants with retained DNA polymerase 
activity. 

In response, it is asserted that 

While one of ordinary skill in the art would know how to make (and use) a family 
A DNA polymerase based on the knowledge available at the filing of the present 
application, as discussed above, the breadth of the claimed mutant family A DNA 
polymerases is considerably broader. As discussed above those structural 
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limitations associated with a family a [sic] DNA polymerase are not necessarily 
associated with a mutant family A DNA polymerase. 

It is this breadth of the claimed mutant DNA polymerases and the lack of 
guidance associated with "enhanced mismatch discrimination activity" associated 
with such a broad genus of mutant DNA polymerases that results in the lack of 
scope of enablement. While applicants argue that those of skill in the art would 
know of certain associated structures or domains associated with the claimed 
DNA polymerases, such are not necessarily associated with the claimed mutant 
DNA polymerases. 

As indicated above, to facilitate allowance and without acquiescing to the 
rejection in the Office Action, Applicants have amended claim 1 to specify that the family A 
DNA polymerase or its Klenow fragment claimed in the present application comprise the A 
motif with the sequence DYSQIELR. Thus, the above-noted structural limitations associated 
with a family A DNA polymerase are associated with the family A DNA polymerase or its 
Klenow firagment claimed in the present application. 

In the previous response. Applicants further argued that the reference cited in the 
Office Action, Ngo, is insufficient to support the non-enablement rejection. First, this reference 
was published in 1994, 10 years earlier than the priority date of the present application. Thus, it 
does not describe the state of the art at the time of the present application. Second, Ngo states 
that it is not known whether there exists an efficient algorithm for predicting the structure of a 
given protein fi-om its amino acid sequence alone {see, second full paragraph on page 492). 
However, computer prediction via algorithms is not the only way that the structure of a given 
protein may be analyzed. As indicated in Patel, high-resolution crystal structures of several 
polykaryotic DNA polymerases have been determined, and the structure and fimction relationship 
of these enzymes have been characterized by various means, including sequence alignments, 
crystal structural analysis, and mutagenesis studies. Thus, Ngo does not specifically describe the 
state of the art regarding polykaryotic DNA polymerases. 

In response to Applicants' previous arguments, it is asserted in the Office Action: 

While applicants argue that the reference Ngo et al. is not representative of the 
skill in the art at the time of filing, and that much has been learned since the 
publishing of the reference Ngo et al., appUcants [sic] reference to the teaching of 
Patel et al. while helpfiil is not sufficient given the extreme breadth of the claimed 
genus of "mutant" DNA polymerases. 
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Again, as indicated above. Applicants have amended the claim to specify that the 
family A DNA polymerase or its Klenow fragment claimed in the present application comprise 
the A motif with the sequence DYSQIELR. Thus, the scope of the claims in the present 
application is not as broad as asserted in the Office Action. Applicants submit that the claims as 
currently pending are enabled by the present application in view of the advanced state of the art 
at the time of filing of the present application. 

In the previous response, Applicants also argued that one of ordinary skill in the 
art would also know how to modify a family A DNA polymerase to increase its mismatch 
discrimination activity in view of the present application. The present application provides that 
mismatch discrimination activity of a family A DNA polymerase may be enhanced by replacing 
Q879 in the motif C sequence QVH with a lipophilic amino acid residue, such as Gly, Ala, Val, 
Leu and He {see, e.g., second paragraph on page 5 and third full paragraph on page 11). Example 
1 of the present application teaches how to make and purify Klenow fragment variants with 
substitution in the motif C sequence QVH. Example 2 of the present application teaches how to 
screen the Klenow fi*agment variants for those with an enhanced mismatch discrimination 
activity. Examples 3, 5 and 6 demonstrate that exemplary variants containing substitutions in the 
motif C sequence QVH have enhanced mismatch discrimination activity in primer extension 
assays and real time PGR experiments. In view of the teachings of the present application, one 
skilled in the art would be able to make a family A DNA polymerase variant having Q879 
replaced with a lipophilic amino acid residue {see. Example 1) and determine its mismatch 
discrimination activity (see. Examples 2, 3, 5 and 6) without undue experimentation. 

In response to Applicants' previous arguments, it is asserted in the Office Action: 

Finally while one of skill in the art would know mechanistically how to modify a 
family A DNA polymerase to increase its mismatch discrimination activity in 
view of the present application, it remains that the art and application do not 
provide guidance at to the specific of exactly how to modify a family A 
polymerase to increase its mismatch discrimination activity. 

Applicants disagree with the above assertion. As indicated above, the present 

application provides that mismatch discrimination activity of a family A DNA polymerase may 

be enhanced by replacing Q879 in the motif C sequence QVH with a lipophilic amino acid 

residue, such as Gly, Ala, Val, Leu and He and demonstrates enhanced mismatch discrimination 
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activity of exemplary family A DNA polymerases. Applicants submit that contrary to the 
assertion in the Office Action, the present application provides specific guidance for one of 
ordinary skill in the art to make and use the claimed family A DNA polymerase without undue 
experimentation and thus enables the claims currently pending in the present application. 

In view of the above remarks, Applicants submit that this ground of rejection 
under 35 U.S.C. 112, first paragraph, has been overcome. Withdrawal of this rejection is 
respectfully requested. 

Applicants believe that the claims pending in the present application are now 
allowable. Favorite consideration and a Notice of Allowance are earnestly solicited. 

The Director is authorized to charge any additional fees due by way of this 
Amendment, or credit any overpayment, to our Deposit Accovint No. 19-1090. 

Respectfully submitted, 

SEED Intellectual Property Law Group pllc 

/Oing Lin/ 

Qing Lin, Ph.D. 
Registration No. 53,937 

QXL:kw 

Enclosure: Patel et al (J. Mol. Biol. 308:823-837, 2001) 

701 Fifth Avenue, Suite 5400 
Seattle, Washington 98104 
Phone: (206)622-4900 
Fax: (206)682-6031 
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Accurate transmission of DNA material from one generation to the next 
is crucial for prolonged cell survival. Following the discovery of DNA 
polymerse I in Escherichia coli, the DNA polymerase I class of enzymes 
has served as the prototype for studies on structural and biochemical 
mechanisms of DNA replication. Recently, a series of genomic, mutagen- 
esis and structural investigations have provided key iasights into how^ 
Pol I class of enzymes function and evolve. X-ray crystal structures of at 
least three Pol I class of erizymes have been solved in the presence of 
DNA and dNTP, thus allowing a detailed description of a productive 
replication complex. Rapid-quench stop-flow studies have helped define 
individual steps during nucleotide incorporation and conformational 
changes that are rate limiting during catalysis. Studies in our laboratory 
have generated large libraries of active mutant enzymes (8000) containing 
a variety of suKstitutions within the active site, some of which exhibit 
altered biochemical properties. Extensive genomic information of Pol I 
has recently become available, as over 50 polA genes from different pro- 
karyotic species have been sequenced. In light of these advancements, we 
review here the structure-fvmction relationships of Pol I, and we highlight 
those interactions that are responsible for the high fidelity of DNA syn- 
thesis. We present a mechanism for "flipping" of the complementary 
template base to enhance interactior\s with the incoming nucleotide sub- 
strate during DNA synthesis. 

C3 2001 Academic Press 
Keywords: DNA polymerase; structure-function; replication; active site; 
recombination 



Introduction 



Primary functions and sequence of prokaryotic 
DNA Pol I 

Prokaryotic species exist in diverse climates, ran- 
ging from radiation-resistant Deinococcus radiodur- 
ans (believed to have evolved one billion years 
ago; Battista, 1997), to thermostable Thermus aquati- 
cus (first isolated from hot springs in Yellow Stone 
National Park; Oiien et al, 1976) to enteric bacteria 
Escherichia coli, to obligate intracellular bacteria 



Abbreviations used: Pol I, polymerase I; RT, 
reverse transcriptase. 
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Chlamydia trachomatis. Despite this environmental 
diversity, prokaryotic DNA Pol I sequence has 
remained remarkably cor^erved through at least 
one billion years (Gutman et al, 1993; Joyce et al, 
1982). The gene encoding DNA polymerase I 
{pal A) contains approximately 3000 base-pairs that 
encodes approximately 1000 amino acid residues 
in a simple polypeptide chain. Organisms separ- 
ated by a billion years of evolution (e.g. Deinococ- 
cus-Therrnus genera and £. coli) have DNA Pol I 
groups that exhibit similar activities, retain ^2>5% 
amino acid identity and R=50 % homology. Interest- 
ingly, the lethal effects of deleting endogenous Pol 
I activity from a specific prokaryote can be comple- 
mented by Pol I from another organism. For 
example, Tatj Pol I can fully restore viability of 
£. coli lacking endogenous Pol I activity (Suzuki 
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et ai, 1996a; Patel & Loeb, 2000a), and E. coU Pol I 
can restore viability in D. radiodurms lacking 
endogenous Pol I (Gutman et ah, 1994). This 
interchangeability suggests that Pol I functions in 
similar pathways in diverse prokaryotes. 

Biochemical and genetic investigations in E. coli 
have contributed to our understanding of Pol I's 
role in vivo. E. coli Pol I, the first polymerase dis- 
covered, is the most abundant polymerase within 
its host (approximately 400 molecules per cell) and 
functions primarily to fill DNA gaps that arise 
during DNA repair, recombination and replication 
(Romberg & Baker, 1992). Prokaryotic Pol Is have 
two functional domains located on the same poly- 
peptide: a 5'-3' polymerase (located towards the C 
terminus) and 5'-3' exonuclease (located towards 
the N terminus) (Joyce & Grindley, 1984). E. coli 
Pol I contains in addition a "non-essential" proof- 
reading 3'-5' exonuclease domain which functions 
to excise DNA replication errors. DNA Pol Is from 
diverse species within the genera Rickettsia and 
Tliermus do not contain motifs Exol, ExoII, and 
ExoIII, which are necessary for 3'-5' exonuclease 
function. Pol 1 uses its 5' nuclease activity to 
remove the ribonucleotide portion of newly syn- 
thesized Okazaki fragments and DNA polymer^e 
activity to fill in the resulting gap. polA(-) E. coli 
join Okazaki fragments tenfold slower than wild- 
type bacteria, and are sensitive to DNA damage by 
UV damage and alkylating agents (Deluca & 
Cairns, 1969). During repair, Pol 1 fills in DNA 
gaps that result from the removal of a variety of 
DNA lesions (e.g. UV-induced thymidine dimer, 
the oxidative lesion 8-oxo guanine, and the alky- 
lation lesion 4-methyl adenine, etc.; for a review of 
repair mecharusms, see Friedberg et al., 1995). 

The Pol I family has served as a prototype for 
studying DNA polymerase mechanisms. There are 
at least six known families of DNA polymerases, 
which are grouped based on amino acid sequence 
homologies: A, B, X, RT, Pol III, and UmuC/DinB 
families. Family A polymerases are found primar- 
ily in organisms related to prokaryotes and include 
prokaryotic DNA polymerse I, mitochondrial poly- 
merase Y, and several bacteriophage polymerases 
including those from odd-numbered phage (T3, T5, 
and T7). Family B polymerases are present in bac- 
teriophages, viruses, archea and eukarotes. Many 
of these polymerses function to replicate the host 
genome, and include those from even-numbered 
phages T4 and T6, herpes viruses, archeal pol 
"Vent", and mammalian pols a, 5 and e. Family X, 
containing mammalian pol p, and n, function 
during DNA repair. The reverse transcriptase 
family contains RTs from retroviruses as well as 
eukaryotic telomerases (Lingner et al, 1997). The 
prokaryotic Pol III family encompasses the DNA 
pol5m\erases that replicate the majority of bacterial 
genomes. The recently discovered UmuC/DinB 
family includes pols t), i, k, and deoxycytidyl 
transferase (for a review, see Friedberg et al, 2000). 



Three-dimensional structures of the Pol I 
active site 

High-resolution crystal structures of DNA poly- 
merases in the Pol I family including Taq Pol I 
(Kim el al, 1995; Li et al, 1998), £. coU Pol I (Kle- 
now) (Beese et al, 1993a; OUis et al, 1985), Bacillus 
polymerase (Klenow-like fragment) (Kiefer et al, 
1998) and bacteriophage 17 DNA polymerase 
(Doublie et nl, 1998) have been determined (for a 
review, see Steitz, 1999). The X-ray crystal struc- 
tures of these polymerases resemble in overall mor- 
phology a cupped human right hand, with fingers 
(which bind an incoming nucleotide and interact 
with the single-stranded template), palm (which 
harbors the catalytic amino acid residues and also 
binds an incoming dNTP) and thiunb (which binds 
double-stranded DNA) subdomaias. Distantly 
related prokaryotic polymerases within the Pol I 
family such as Taq Pol I and E. coli Pol I exhibit 
less extensive homologies, nevertheless have three- 
dimensional structures that are virtually identical, 
with secondary structural elements, a-helices and 
p-strands, that are all nearly completely superim- 
posible (Figure 1). Tliese studies show extensive 
sequence heterogeneity between enzymes can 
produce identical folding patterns. The structural 
identity in turn suggests Pol I enzymes from 




Figure 1. Superposition of Taq Pol I (Li et al, 1998) 
and £. coli Pol I (Klenow fragment; Beese et nl., 1993a) 
crystal structures. Tnq Pol I (red) and £. coli Pol I (blue) 
are separated in evolution by one billion years. The 
structures of these two enzymes are superimposible, 
such that corresponding a-helices and (i-strands adopt 
nearly identical folding patterns (on average, <1 A 
difference). Conformation of the fingers subdomain 
(magenta and teal) differ slightly, suggesting this region 
is flexible. Superpostioning of other related Pol 1 class of 
enzymes yielded similar results. All structure ilius- 
tratioas were prepared by E. Adman using MOLSCRIPT 
(Kraulis, 1991) and RasterSD (Merritt & Murphy, 1994). 
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diverse organisms function to incorporate nucleo- 
tides by identical mechanisms. 

Amino acid sequence alignments of related 
enzymes can help identify important regions with 
common catalytic functions (Figure 2). Delarue et al. 
(1990) compared the C-terminal polymerase 
domains amongst distantly related members of the 
Pol I family of enzymes. Following alignment, five 
conserved regions (numbered 1-5) within diverse 
DNA Pol Is were delineated; in addition, we ident- 
ify a sixth conserved region (region 6). Three of 
these regions (3, 4, and 5) resembled the most 
hig^y conserved domains of mammalian DNA 
pol a in family B (termed motif A, B, and C; 
Figures 3 and 4). Analysis of high-resolution crys- 
tal structures of family A pol)anerases in complex 
with DNA and an incoming nucleotide suggests 



each of these six regions have an important role 
during DNA synthesis (Figure 2, region 5). Region 

1 amino acid residues are located at the tip of the 
thumb subdomatn and form a helix-loop that inter- 
acts with the minor groove of the double-stranded 
DNA during the nucleic acid binding step. Region 

2 amino acid residues are located within the palm 
subdomain and interact with the template strand 
(along the minor groove) and thus, form the "tem- 
plate grip". Region 6 amino acid residues interact 
with the first template base; as a result, this tem- 
plate could be "flipped out" of the helix axis by 
>90°, such that it cannot base-pair with the 
incoming dNTP (see below). Amino acid residues 
within regions 3, 4, and 5 correspond to motifs A, 
B, and C, respectively, and are located near the 
incoming nucleotide triphosphate and thus form 



Taq ! 444 MEATGVRLDVAYLRALSLEVAEEIARLEAEVFRIAGHPFNLNSRDQLERVL — -FDELG 

Bst I 488 MEFTGVKVDTKRLEQMGAEIjTEQLQAVERRIYELAGQEFNINSPKQLGTVL PDKLQ 

FN +S +++ L + + 

T7 . ******************* ***************«****pi,pssRDHiQKKLQEAGWVPTKYTDKG 



Taq : 500 LPAIGKTEKTGKR---STSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRLH 

Bst : 544 LPVLKKT-KTGY STSADVLEKLAPHHEIVEHILHYRQLGKLQSTYIEGLLKWHPVTGKVH 

+P++ + G +++ + I + I + + K+ Y+ G++H 
T7 : 36X APWDDEVLEGVRVDDPEKQAAIDLIKEYLMIQKRIGQSAEGDKAWLRYV AEDGKIH 



Taq : 562 TRPNQTATATGRLSSSDPNLQNIP-VRTPLGQRIRRAFIAE EGWLLVALDYSQIELR 

Bst ! 604 TMFNQALTQTGRLSSVEPNLQNIP-IRLEEGRKIRQAFVPSE PDWLIPAADYSQIELR 

N TGR++ PNL IP +R G++ R AF W+ + +D+S +ELR 

T7 : 418 GSVNPNGAVTGRATHAFPNLAQIPCSVRSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELR 



Taq : 618 VLAHr.SGDENLIRVFQE--GRDIHTETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQ 
Bst : 661 VLAHIAEDDNLIEAPRR--GLDIHTKTAMDIFHVSEEDVTANMRRQAKAVNFGIVYGISDYGLAQ 
LAH + + + + DIHT+ + + +R+ AK +G LYG + Q 

T7 : 483 CLAHFMARPDNGEYAHEILNGDIHTKN QIAAELPTRDNAKTFIYGFLYGAGDEKIGQ 



Taq : 681 ELAIPYEEAQAPIERYPQSPPKV-— RAWIEKTL-EEGRRRGYVETLPGRRRYVPDLEARVKSVR 

Bst : 724 NLNITRKEAAEFIERYFASFPGV KQYMDNIV-QEAKQKGYVTTLLHRRRYLPDITSRNFNVR 

+ + ++ +++ p + + ++ + + + + +RR++ + R VR 

T7 : 540 IVGAGKERGKELKKKFLENTPAIAALRESIQQTLVESSQWVAGEQQVKWKRRWIKGLDGRKVHVR 



Taq I 742 EAAERMAFNMPVQGTAADLMKLAMVKLF-PRLEEMG-AR MLLQVHDELVLEA-PKER 

Bst : 785 SFAERTAMNTPIQGSAADIIKKRMIDLSVSVREERIjQAR LLLQVHDELILEA-PKEE 

+ A N+ +Q A + K ++ E ++ + VHDE+ + +E 

T7 : 605 --SPHAALNTLLQSAGALICKLWIIKTE-EMLVEKG-LKHGWDGDFAYMAWVHDEIQVGCRTEEI 



Taq : 796 AEAVARLAKEVMEGV-— YPLAVPLEVEVGIGEDWLSAK* 831 

Bst : 841 lERLCRLVPEVMEQA— -VALRVPLKVDYHYGPTWYDAK* 876 

++ + + + E+M+ + L+ + G W 

T7 : 666 AQWIETAQEAMRWVGDHTOFRCLLDTEGKMGPNWAICH* 700 

Figure 2. DNA polymerase I sequence alignments. Sequences of Taq Pol 1, 17 Pol and Bst Pol I (accession numbers 
70368, GI2126855) were obtained from GenBank and aligned using NCBI BLAST family of programs. Conserved 
amino add residues are listed, and conserved motife are higliiighted. Regions of insertion or deletions are marked by 
(-), and (*) indicates an inserted segment within T7 DNA polymerase that allows binding with an accessory protein 
and processivity factor, thioridoxin. 
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THERMUS AQUATICUS 
THERMUS THEFOVIOPHILUS 
THEfOlUS FILIFORMIS 
DEINOCOCCUS RADIODURANS 
ESCHERICHIA COLI 
HAEMOPHILUS INFLUENZAE 
STREPTOCOCUS PNEUMONIAE 
MYCOBACTERIUM TUBERCULOSIS 
MYCOBACTERIUM LAPRAE 
TREPONEMA PALLIDUM 
CHLAMYDIA TRACHOMATIS 
BORIEIA BURGDORFERI 
HELICOBACTER PYROLI 
LACTOCOCCUS LACTIS 
MYTHELOBACTERIUM 
RHODOTHERMUS OBAMENSIS 
RICKETTSIA PROWAZEKII 
STREPTOMYCES COELICOLOR 
BACILLUS STEAROTHERMOPHILUS 
SYNECHOCYSTIS SP 
AQUIFEX AEOLICUS 

APSE-1 DNA polymerase 
T7 DNA polymerase 
T5 DNA polymerase 

Concensus sequence 



MOTIF A 

LLVALDYSQIELR 
ALVALDYSQIELR 
LLLAADYSQIELR 
TLIAADYSQIELR 
VIVSADYSQIELR 
SIVAADYSQIELR 
VLLSSDYSQIELR 
ELMTADYSQIEMR 
ELMTADYSQIEMR 
ELISADYTQIELV 
YFLAADYSQIELR 
IFISADYSQIELA 
CLLGVDYSQIELR 
LLLSSDYSQIELR 
KLISADYSQIELR 
KLLSADYVQIELR 
KLISADYSQIELR 
SLMTADYSQIELR 
LIFAADYSQIELR 
LLVSADYSQIELR 
TFVISDFSQIELR 

klvisdlsniegr 
vqagidasglelr 
rviawdlttaevy 



MOTIF B 

RRAAKTINFGVLY 
RRAAKTVNFGVLY 
RRAAKTVNFGVLY 
RRAAKTVNFGVLY 
RRSAKAINFGLIY 
RRNAKAINFGLIY 
RRNAKAVNFGWY 
RRRVKAMSYGLAY 
RRRVKAMSYGLAY 
RRIAKTINFGIVY 
RYQAKAVNFGLVY 
RRIAKSINFGIIY 
RSIAKSINFGLVY 
RRNAKAVNFGWY 
RRRAKTINFGIIY 
RRRAKMVNYGIPY 
RRKAKAINFGIIY 
RRKIKAMSYGLAY 
RRQAKAVNFGIVY 
RNLGKTINFGVIY 
RQLAKAINFGLIY 



MOTIF C 

LLQVHDELVLE 
LLQVHDELLLE 
LLQVHDELVLE 
LLQVHDELLIE 
IMQVHDELVFE 
IMQVHDELVFE 
LLQVHDEIVLE 
LLQVHDELLFE 
LLQVHDELLFE 
LLQVHDELIFE 
LLQIHDELLFE 
LLQVHDEMLIE 
LLQVHDELIFE 
LLQVHDEIILD 
LLQVHDELVFE 
LLQVHDELVFE 
ILQIHDELLFE 
LLQVHDEIVLE 
LLQVHDELILE 
LLQVHDELIFE 
VNLVHDEIWE 



rqigkvmelglgy ivtvhdeiise 
rdnaktfiygf ly mawvhdeiqvg 
rqaakaitfgily vmlvhdswai 



xhhhhDhhxhEhx RpxxKxxxhGhhY hhxhHDxhxxx 



Figure 3. Pol I sequences of motifs A, B, and C. Sequences of prokaryotic Pol 1 (in capital letters) and bacteriophage 
Pols (in lowercase letters) were obtained hom GenBank and aligned using NCBI BLAST family of programs. The 
sequences of conserved motifs A, B, and C are shown. In tlie conseasus sequence (bottom), amino acids that are 
absolutely conseived are listed; h represents hydrophobic amino acid, and x represents diverse amino acids occupy a 
particular po.sition. 



portioas of the dNTP binding deft within the DNA 
polymerase active site. 

Motifs A and C are located within the palm sub- 
domain and are conserved in structure within 
families A, B, RT, and X polymerases, while motif 
B is located in the fingers subdomain. Structurally, 
motifs A, B and C of Taq DNA Pol I, E. coli Pol I, 
Bst Pol I, and 17 DNA Pol are nearly indistinguish- 
able (Figures 3 and 4). Motif A begins at an anti- 
parallel p-strand containing predominaiitly 
hydrophobic residues and continues to an a-helix. 
Amino acid sequences of a large variety of natu- 
rally occurring prokaryotic DNA Pol Is (Figures 3 
and 4) show many sequences can produce the 
"antiparallel P-strand" portion of motif A; how- 
ever, the "turn and a-helix" portions consistently 
contain the "DYSQIELR" amino acid sequence. 
Motif B constitutes an a-helix located in the fingers 
subdomain and is found in both A and B families 
of DNA polymerases. DNA Pol I sequence align- 
ment indicates motif B contains a consensus 
sequence RRxhKhhNFGhhY, where h represents 
hydrophobic amino acid and x indicates any 
amino acid. Structurally, motif C amino acids form 
two antiparallel p-strands. While diverse amino 
acid sequences form the antiparallel P-strands, the 
loop between these strands invariably contains the 
sequence "HDE". Comparison of DNA polymerase 



sequences from bacteriophage, which evolve at 
very high rates, with that of prokaryotic Pol Is 
shows that few active site anuno acid residues are 
conserved between these groups. For example, 
only motif A amino acid residues Asp610 and 
GIu625 and motif B residues Arg659, Lys663, 
Gly668, and Tyr671, and motif C amino acid resi- 
dues His784 and Asp785 are entirely conserved, 
(All amino acid numbering is based on the Taq Pol 
I sequence.) Sequence alignments between distant 
family members indicate only a few amino acid 
residues within the polymerase active site are con- 
served in the Pol I family of enzymes; however, a 
comparison of individual Pol I sequences to crystal 
structures indicates that a significant level of 
sequence heterogeneity (>50 %) can yield structural 
identity. To further understand the potential roles 
of active site amino acid residues and gain insights 
into DNA polymerization mechanisms, we will 
examine structures of Pol I complexed with diverse 
substrates encountered during DNA replication. 

Pathway of DNA Synthesis 

The rates and steps involved during single 
nucleotide incorporation reactions have been deter- 
mined for many polymerases, including those in 
the Pol I class (e.g. Taq Pol I (Brandis el al, 1996), 
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T7 DNA Pol (Patel et al, 1991), and E. coli Pol I 
(Bryant et al, 1983; Kuchta et al, 1988)). There 
emerges a general mechanism for single nucleotide 
additions: 

dm? ^ 

E+TP^E-TP^E-TP-dNTP 
1 2 



the tip of the thumb domain (helices HI and H2; 
region 1; Figures 2 and 4) rotate in the opposite 
direction relative to the rest of the thumb such that 
the tip is in proximity to the DNA. Together, these 




(1) 



During DNA synthesis (equation (1)): 1, poly- 
merase (E) binds with template-primer (TP); 2, the 
appropriate dNTP binds with polymerase-DNA 
complex; 3, a nucleophilic attack results in phos- 
phodiester bond formation; and 4, pyrophosphate 
(PPi) is released. Rapid quench kinetic experiments 
indicate that DNA binding (1) and nucleotide bind- 
ing (2) occur very rapidly. The rate-limiting step is 
either phosphodiester bond formation or a confor- 
mational change that precedes nucleotide incorpor- 
ation, The incorporation of complementary 
nucleotides involves dynamic interactions between 
the polymerase with its nucleic acid and dNTP 
substrates. Polymerases are proposed to undergo 
at least four significant conformational changes: (i) 
during the DNA binding step; (ii) subsequent to 
the dNTP binding step and immediately preceding 
chemical catalysis; (iii) subsequent to nucleotide 
incorporation during PPi release; and during (iv) 
translocation towards the new primer 3'-OH termi- 
nus. Theoretical studies indicate that multiple con- 
formational changes contribute to the fidelity of 
catalysis by DNA polymerases (Beckman & Loeb, 
1993). High resolution crystal structures exist 
within the Pol I family of enzymes with all of these 
intermediates: (1) without DNA and dNTP {Taq 
Pol I, E. coli Pol I (Klenow)); (2) with DNA in the 
active site but without dNTP (Tfl(j Pol I); (3) with 
DNA and dNTP (Jaq Pol I and T7 pol); (4) with 
pyrophosphate (Klenow)(Beese et al, 1993b); and 
(5) translocated following dNTP addition {Est Pol 
I). Of these crystal struchires, Taq Pol l:DNA:dNTP 
(Li et al, 1998), T7:DNA:dNTP (Doublie et al, 1998) 
and J3sf Pol:DNA:dNTP (Kiefer et al, 1998) are par- 
ticularly revealing, as each of these high-resolution 
structures depicts important steps during catalysis. 

Binding of polymerase to DNA 

The first step in polymerization involves the 
association of the polymerase with the template- 
primer. Comparisons amongst the crystal struc- 
tures of Taq Pol 1, Est Pol I, and E, coli Pol I in com- 
plex with DNA show that the thumb subdomain 
changes conformation to nearly completely wrap 
around the DNA. Two significant enzyme confor- 
mational changes occur within Taq Pol I thumb 
subdomain (Li et al, 1998): (1) the thumb subdo- 
main rotates towards the palm subdomain; and (2) 
the conserved amino acid residues located within 



changes result in an approximately 30 A wide 
cylinder that almost completely engulfs the DNA, 
such that conserved amino acid residues within the 
tip of the thumb subdomain (region 1) "grip" the 
DNA along the minor groove. A conformation 
change wiSiin the thumb subdomain following 
DNA binding has also been reported for HIV-1 RT 
(Patel et al, 1995). In each of the diverse polymera- 
se:DNA high-resolution complexes studied, the 
enzyme interacts primarily with the sugar-phos- 
phate DNA backbone along the minor groove, and 
these interactions are associated with bending of 
the DNA such that it adopts an S-shaped confor- 
mation. Taq Pol I interactions with the thumb sub- 
domain along the minor groove result in bending 
double-stranded DNA three bases from the primer 
terminus, causing the DNA near tlie active site to 
adopt an >l-like conformation (Figures 4 and 5). A 
second set of interactions, where the single strand 
template interacts with the palm subdomain (by 
region 2 or "template grip" amino acid residues), 
also contributes to this bend. A third set of poly- 
merase:DNA interactions, results in a >90° 
rotation of the first template base and a 180° 
rotation of the adjacent template bases, such that 
the template bases are flipped outside the DNA 
helbc axis and away from the dNTP binding site 
(Figures 4 and 5). This unusual S-shaped DNA 
conformation, in which the first bend is induced by 
interactions with the thumb and palm subdomains 
and the second bend is induced by portions of the 
active site interacting with the single-strand tem- 
plate, is also seen in high-resolution structures of 
Est Pol I, T7 pol, and HIV-1 RT bound to DNA 
(Figure 4), Thus, this DNA conformation, including 
template base initially flipped out of the helix axis 
by >90°, may have a general relevance to the 
DNA polymerization mechanism by a wide variety 
of enzymes. 

Nucleotide binding 

A second conformation change within poly- 
merases occurs during the dNTP-binding step. 
High-resolution crystal structures of Taq Pol I (Li 
et al, 1998), T7 DNA Pol (Doublie et al, 1998), 
HIV-1 RT (Huang et al, 1998), and Est Pol I (Kiefer 
et al, 1998) bound to DNA in the presence of an 
incoming nucleotide suggest at least three events 
occur during the dNTP binding step (Figures 5 and 
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Figure 4. Active sites of Bst Pol I (Kiefer et al. (1998), panel 1), Taq Pol I (Li et al. (1998), panel 2), and T7 Pol 
(Doublie et nl. (1998), panel 3). The location of motifs A (red), B (green) and C (blue), along witli other secondary 
structui-al elements are nearly identical within these diverse polymerases. In addition, binding to DNA by each of 
these polymerases distorts the conformation of the nucleic add. Notably, the first template bases are "flipped" out of 
the helix axis. Bacillus polymerase {Bst Pol 1; 4bdp) depicts conformation of the protein following nucleotide incorpor- 
ation; thus, the protein is bound to DNA but not an incoming nucleotide. In this state, the polymerase is in an "open 
conformation" with the fingers subdomain (motif B) located away from the catalytic triad residues located in the 
palm subdomain (motifs A and C). The Taq Pol I (3ktq) and T7 pol (lt7p) structures show polymerases bound to 
DNA and an incoming nucleotide. In this state, polymerases adopt a "closed" conformation, with fingers subdomain 
motif B elements located near the palm subdomam. 



6). We propose all three of the following steps are 
central to an "induced-fit" model for nucleotide 
incorporation. 

(1) Structural elements within the fingers domain 
rotate towards the 3' primer terminiK, resulting in 
a "closed" structure. 

(2) The template base rotates back into the helix 
axis (by ^90°). 

(3) The base portion of the incoming nucleotide 
forms a Watson-Crick base-pair with tiie template 
base and ihe triphosphate portion forms metal- 
mediated ionic interactions with amino acid resi- 
dues of the active site. 

Four high resolution DNA polymerase structures 
of the ternary enzyme:DNA:dNTP complex, two in 
an "open conformation" (Bst Pol I and Taq Pol I) 
and two in a "closed" conformation (T7 pol and 
Taq Pol I) provide valuable insights into the struc- 
tural basis for nucleotide incorporation. Two 
unique conformations of Taq Pol I:DNA:dNTP 
complex were discovered by Waksman and col- 
leagues, following crystallization experiments with 



varying amounts of the incoming nucleotide. In 
the presence of a low concentration of the incom- 
ing nucleotide, the polymerase is in an open con- 
formation (Figures 5(a) and 6(a)), and the Taq Pol I 
struchire (Li et al, 1998) is identical to that of Bst 
Pol I (which depicts the conformation of polymer- 
ase following dNTP incorporation and poised for 
the next round of synthesis (Kiefer et nl, 1998)). In 
both of these structures, the first template base is 
flipped out >90° away from the helix axis, and a 
highly conserved motif B tyrosine residue (Tyr671 
of Taq Pol I and Tyr714 of Bst Pol I; Figure 7) is 
positioned exactly where the template base would 
be expected, on top of the first base-pair of the 
double-stranded DNA. Thus, Tyr671 maintains 
stacking interactions expected of the template 
base. In this open conformation of the Taq 
Pol:DNA:dNTP complex, the triphosphate portior« 
are coordinated by Asp610 and Asp785, and the 
base portion of the incoming dNTP lies in a solvent 
exposed pocket and stacks on top of the first base- 
pair of the dsDNA, adjacent to T5rr671. Watson- 
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Open Closed 
Taq pol I 



Figure 5. Location of conserved regions witlTin Taq Pol I. Six conserved regions witliin DNA Pol I are shown in 
"open" (a) and "closed" (b) confomaations of Taq Pol I. Region 1 (light green) is located within the thumb subdomain 
and interacts with the DNA minor groove. Region 2 (orange) is located within the palm subdomain and forms the 
"template grip" tliat interacts with the DNA near the minor groove. Motif A (red) interacts witli an incoming diva- 
lent cation-dNTP complex. Motif B (dark green) also interacts with the incoming dNTP, but is located in the fingers 
subdomain and changes conformation to adopt a "closed" form during the nucleotide binding step. Motif C (blue), 
along with motif A, is located in the palm subdomaiii; both of these motifs harbor the catalytic triad carboxylate con- 
taining amino acids. Region 6 (green-yellow) is located in the palm subdomain and interacts with the template strand 
near the site of "base flipping". The primer strand is shown in yellow, the template strand is in cyan, the flipped out 
bases are colored in purple, and the incommg ddNTP is colored in magenta. 



Crick base-pairing interactions are not observed 
initially, as the template base is flipped out. 

At a higher concentration of the incoming 
nucleotide, the polymerase adopts a "closed" state 
(Li et at, 1998) (Figures 5(b) and 6(b)). In this con- 
formation: (a) the template base is "flipped" ^90° 
into the helix axis; (b) there is a Watson-Crick 
base-pairing with the inconung nucleotide; (c) 
Phe667 of O-helix forms stacking interaction with 
the incoming dNTP; and (d) the fingers subdomain 
changes conformation to almost completely engulf 
the incoming nucleotide substrate. This confor- 
mational change affects significantly the position of 
motif B amino acid residue (O-helix) located 
within the fingers subdomain, and occurs concomi- 
tantly with a 40" rotation of the O-helix towards 
the catalytic site amino acid residues Asp610 and 
Asp785. Following these conformation changes, the 
base portion of the incoming nucleotide packs 
against two hydrophobic planar amino acid resi- 
dues (Tyr671 and Phe667) located in motif B, and 
the ribose portion of the nucleotide packs against 
Ile614 and the aliphatic portion of Glu615. Thus, a 
hydrophobic pocket surrounds the base and ribose 
portions of the incoming dNTP. The negatively 
charged triphosphate portion of the dNTP, follow- 
ing the conformational change, interacts with two 
basic (positive charged) amino acid residues 
(Lys663 or Arg659) in motif B. Two acidic side- 
chains, Asp610 and Asp785, in motif A and C, 



respectively, also participate in metal-mediated 
interactions with the triphosphate group; these 
amino acid residues are directly involved in stabi- 
lizing the transition state during bond formation. 

An induced-fit model for nucleotide incorpor- 
ation should explain how the following three inter- 
actions with the incoming nucleotide are formed 
during dNTP binding: (1) hydrogen bonding with 
the template base; (2) stacking interactions with 
planar ringed amino acid residues; and (3) electro- 
static interactions with negatively charged phos- 
phate groups and charged side-chains. The Taq Pol 
I "open" and "closed" structures suggest that the 
triphosphate portion of the incoming dNTP 
initially binds near the catalytic residues Asp610 
and Asp785 via metal coordination. We propose 
protein:dNTP-stacking interaction occurs as the 
O-helix Phe667 residue interacts with the base 
position of the incoming nucleotide. Development 
of this stacking interaction facilitates movement of 
the O-helix (motif B) towards motif A amino acid 
residues and displaces Tyr671, which in turn 
allows the template to rotate >90° back into the 
helix axis. If the incoming nucleotide is comp- 
lementary to tlie base, then hydrogen bonding 
with the template base would further stabilize the 
dNTP. Stable stacking and base-pairing inter- 
actions would provide time for the tip of the finger 
(O-helix) to come in close proximity with the palm 
subdomain. This conformational change results in 
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Open (low [dCTFj) Closed (high [ddCTP]) 



Taq pol I 

Figure 6. Coiiformational changes during dNTP binding. The Figure iUustrates some of tlie major interactions and 
conformation changes during tlie dNTP-binding step. Initially, during dNTP binding (magenta), the template base 
(purple) is rotated out of the helix axis, thus cannot base-pair with the incoming nucleotide. The expected position of 
the template is taken up instead by a highly conserved Tyr (Y671) residue, which hydrogen bonds with the dNTP. In 
addition, motif B (O-helix; green) adopts an open conformation. Following the completion of the dNTP binding step; 
(1) conformation change by O-helix displaces Tyr671 from stacking with the base-pair; (2) the template base rotates 
into the helix axis; and (3) Watson-Crick base-pairing occurs. The conformation change by O-helix results in a gener- 
ally hydrophobic pocket into which the base and ribo.se moiety bmd and a hydrophilic pocket, which houses the tri- 
phosphate portion of the incoming dNTP. Following the conformation change the pam-OH group of Tyr671 
hydrogen bonds with the carboxylate of Glu615. In addition, tliis conformation change serves to bring the dNTP 
a-phosphate close to the primer 3'-OH group. The conformation of motif A (red), and C (blue) elements, as well as 
region 6 elements (light green) do not change significantly during the dNTP binding step. 



hydrophobic interactions with the base and ribose 
portions of the incoming dNTP and hydrophilic 
interactions with the triphosphate group. Overall, 
the induced-fit model for nucleotide incorporation 
involving a conformation change of the O-helix 
and template rotation of >90° allows establish- 
ment of stacking interactions and serves to bring 
the dNTP a-phosphate close to the primer 3' -OH 
group, thus promoting chemical catalysis. This 
induced-fit mecharusm for nucleotide selection 
would also restrict conformations and structures of 
the incoming nucleotide, promoting the efficiency 
of correct nucleotide incorporation. In summary, 
we propose that stable stacking interactions with 
the new base-pair, in conjunction with strong elec- 
trostatic interaction brought about by the move- 
ment of the tip of the O-helix into dose proximity 
to the palm, results in a conformation that 
enhances base selection by DNA polymerases. 



Role of the "base-flipping" mechanism 

Crystal structures of several enzymes that pro- 
cess either normal or damaged bases (including 
glycosylases and methyl transferases) function by 



first rotating the designated base out of the DNA 
helix axis (Roberts, 1995). This base flipping mech- 
anism involves a rotation around the phosphodie- 
ster bonds so that the base is flipped 180° out of 
helix axis and into the active site of the enzyme. 
All of the high-resolution DNA polymerase:DNA 
crystal structures indicate that the first template 
base is rotated out by ^90° and the second tem- 
plate base is rotated by 180" (Figures 4-7). It is cur- 
rently unknown whether this rotation in diverse 
DNA metabolizing enzymes involves an active 
mechanism that rotates the sugar phosphate back- 
bone, or whether this rotation out of the helix axis 
occurs by passive diffusion and the rotated confor- 
mation is stabilized by specific active site amino 
acid residues. Within diverse Pol I structures com- 
plexed with DNA, amuio acid residues within 
helix Q (Region 6 amino acids, specifically in Taq 
Pol Arg746 to Asp759) interact with the sugar- 
phosphate backbone near the first template base. 
The flipped out first template base is initially 
stabilized by stacking interactions between helices 
O and Ol, and flipped out position of the second 
template base stacks with His676 of helix 01 
within Taq Pol I (and Tyr719 of Bst Pol I). 
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Bst pol I Taq pol I 



Figure 7. Conformation of the translocated coniplex. Bst Pol I complex (a) depicts the conformation of Pol I follow- 
ing nucleotide incorporation and translocation. The 0-heiix in this structure adopts an open conformation, Tyr714 
(equivalent to Y671 of Taq) stacks with the newly synthesized base-pair, and the template base (purple) is rotated out 
of the helix axis. Faint electron density appears for the resulting pyrophosphate group near the basic O-helix residues. 
The overall corrformation of Bst Pol I resembles that of Taq Pol I complexed with DNA in the absence of dNTP (b). 



A mechanism for "base flipping" during DNA 
polymerization has several interesting implications. 
Such a mecharusm could prevent —1 frame shift 
errors by hindering base-pairing to upstream 
templates, and thus enhance the fidelity of DNA 
replication. During the dNTP-binding step, the 
"flipping-in" mechanism allows establishment of 
stable stacking interactions betv^een the new base- 
pair and planar side-chains within the polymerase 
active site (Tyr671 and Phe667). These stacking 
interactions help form a hydrophobic pocket that 
surrounds the base and ribose portions of the 
incoming dNTP and the template base. Thus, these 
base-stacking interactions contribute to the DNA 
polymerase base-pairing fidelity by allowing the 
enzyme to form a pocket that can recognize the 
shape of the incoming new base-pair, prior to the 
nucleotide incorporation step. Lastly, the torsion 
created by twisting the single-stranded template 
strand could be important for translocation (see 
below). 

Nucleophilic attack 

The catalytic site of diverse DNA polymerases is 
composed of three acidic amino acid residues with 
carboxylate side-chains (in Taq Pol I: Asp610, 
Asp785, and Glu786). The ternary closed Taq Pol I: 
DNA: dNTP structure shows that two of the car- 
boxylate side-chains (Asp610 and Asp785) bind 
two metal ions (Li et ai, 1998); the precise role of 
the third carboxylate amino acid residues is 
unclear. Each metal (metal A and B; Figure 6(b)) is 



octahedrally coordinated and contains six ligands, 
four in one plane and one on each side of the 
plane. Metal A is coordinated by 3' OH, the a- 
phosphate group, oxygen atoms of Asp610 and 
Asp785 side-chains, and two water molecules. 
Metal B is coordinated by: a-, (3- and y-phosphate 
groups, carboxylates of Asp610 and Asp785, and 
the carbonyl of Tyr611. The two metal sites are 
3.8 A apart, and each ligand is separated by 2.2 A, 
a coordination symmetry consistent with octa- 
hedral coordination observed in many protein: 
dNTP structures. A similar two-metal coordination 
scheme also was observed in Pol P:DNA:dNTP 
(Pelletier et al, 1994), T7 DNA polymerase: 
DNA:dNTP (Doublie et al, 1998) and HP/ 
RT:DNA:dNTP (Huang et al, 1998) structures. The 
interaction of metal A with the a-phosphate is 
thought to enhance the electropositive character of 
the phosphorous atom; in addition, metal A inter- 
action with primer 3' -OH is thought to facilitate 
deprotonation of the 3' -OH group (Brautigam & 
Steitz, 1998). Metal A coordination geometry 
brings the two reacting groups (the nucleophile 3'- 
OH and a-phosphate) in close proximity and facili- 
tates the phosphoryl transfer reaction. 

Pyrophosphate release and translocation 

The phosphoryl transfer reaction leaves, as pro- 
ducts, a primer elongated by a single nucleotide 
and a pyrophosphate group. Analysis of an 
"open" £. coli Pol I (Klenow) bound to PPi (Beese 
et al., 1993b) and several "closed" Pol I structures 
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indicates that following nucleotide incorporation, 
the polymerases adopt an "open" conformation, 
translocate to next template position, and release 
PPi prior to beginning the next cycle of nucleotide 
incorporation. All Pol I class of enzymes not 
bound to both DNA and dNTP are in an "open" 
conformation. Within the "open" E. coli Pol I:PPi 
complex, PPi is complexed with the conserved Lys 
and Arg (within O-helix) motif B amino acid resi- 
dues. Within the Taq Pol I:DNA:dNTP and T7 
pol:DNA:dNTP "closed" complexes, the p and 
7-phosphates of ttie dNTP interact with the O-helix 
coraerved side-chains Arg659 and Lys663 and 
motif A amino acid residues Asp610 and Asp785, 
and the base and ribose portions of dNTP are 
bound in a hydrophobic pocket (Figure 6(b)). Thus, 
during the transition from a "closed" to "open" 
conformation, O-helix amino acid residues Arg659 
and Lys663 can function to remove PPi away from 
the catalytic site. What causes the polymerase to 
adopt an "open" conformation? We propose con- 
formational change into an "open" state is 
accompanied by a disruption of the stackit\g inter- 
actioiw following nucleotide incorporation and 
translocation. Consistent with this hypothesis, the 
structure of the Bst Pol LDNA shows, following 
nucleotide incorporation and translocation, the 
enzyme is in an "open" conformation and there is 
no pocket that engulfs the incorporated nucleotide. 
In this structure, the PPi group has diffused away 
from the active site (Figure 7(a)). 

Many of the Pol LDNA interactions by region 1 
and 2 amino acid residues involve the sugar-phos- 
phate backbone along the minor groove. These 
non-specific interactions allow polymerase (or the 
DNA) to translocate freely in a spiral motion. Fol- 
lowing nucleotide incorporation and translocation, 
the same interactions of the polymerase active site 
would need to be reformed at the new primer ter- 
minus. Thus, translocation needs to be halted 
when the primer 3'OH group is near the active 
site. Tyr714 within the translocated Bst Pol I:DNA 
complex (ryr671 of Tat^ Pol I) projects into the 
DNA and stacks over the template base of the 
newly formed base-pair (Figure 7(a)). Following 
translocation, this interaction positions the primer: 
template at the catalytic site. The strain created by 
the DNA helix distortion (i.e. A-ioim DNA at the 
active site and 90-180° base flipping of the tem- 
plate strand) may help guide the direction of trans- 
location (5'-3') with respect to the primer. One can 
speculate that other DNA metabolizing enzymes, 
including helicases, may adopt a base flipping 
mechanism to simultaneously eliminate base-pair- 
ing and stacking interactions within the double 
helix during DNA unwinding. In addition, the dis- 
tortion created by rotating the sugar phosphate 
backbone may guide the direction of DNA 
unwinding. 



Function of Individual Amino Acid 
Residues in Polymerization 

Structural data indicate that fewer than 10 out of 
over 50 amino acid residues in conserved motifs 
have direct interactions with the incoming dNTP 
substrate. For example, essential putative functions 
can be ascribed to amino acid residues Asp610 and 
Glu615 (of motif A), Tyr671, Phe667, Lys663, and 
Arg659 (of motif B), Asp785 and Glu786 (of motif 
C); however, the precise roles of all other amino 
add residues within these conserved motifs are 
unclear. The function of different amino acid resi- 
dues and side-chains in catalysis can be gleaned 
from a combination of kinetic measurements, site- 
specific mutagenesis, and random mutagenesis 
coupled with genetic selection. Site-specific muta- 
genesis has the advantage that one can assign a 
particular change in kinetics or substrate specificity 
to a distinct amino acid substitution. However, this 
approach is frequently limited in that catalytic effi- 
ciency of the enzyme is compromized, and the 
effects observed might reflect a global change in 
enzyme structure. Using random mutagenesis with 
positive genetic selection, one obtains mutant 
enzymes ffiat exhibit catalytic efficiencies similar to 
the wild-type. However, many of the enzymes con- 
tain multiple substitutions that militate against the 
direct assignment of altered catalytic properties to 
specific substitutions. As a result, a combination of 
these approaches offers many advantages in struc- 
ture-function studies. In considering the result so 
far obtained, we have chosen to analyze substi- 
tutions that occur in different motifs with the 
understanding that this simplification does not 
adequately consider distant interactions of amino 
acid substitutions. 

Extensive number of site-directed mutagenesis 
studies, yielding enzymes with single amino acid 
substitutions have been done with the Klenow 
fragment, Pol |3, and HIV RT. Kinetic analysis of 
these polymerases, which have a conserved amino 
acid residue substituted by a neutral residue, show 
these mutants frequentiy have low activity, and 
some mutants have a unique substrate specificity. 
For example, Joyce and colleagues (Minnick d ai, 
1999) replaced 28 conserved side-chains individu- 
ally within the E. coli Pol I (Klenow fragment) 
active site to alanine (or, in one example, to leu- 
cine), and tested each of the mutants for activity 
and fidelity. In these experiments, the majority of 
mutants had a 5- to 100-fold reduction in activity 
relative to wild-type using gapped DNA as a tem- 
plate. Only four of the 28 mutants tested exhibited 
low fidelity. Specifically, E710A within motif A 
(equivalent to E615A in Taq Pol I), Y766A (Y671A) 
within motif B, Arg668 (Arg573) and Arg682 
(Arg587) within region 2, and N845A (Asn750) of 
region 6 have low fidelity. These results suggest 
fidelity is determined by very few conserved 
amino add interactions. In addition, a complete 
deletion of region 1 amino acid residues (located at 
the tip of the thumb subdomain and binds within 
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the minor groove of the DNA duplex) results in 
polymerases with low activity, processivity, and 
high propensity for +1 frame-shift errors (Minnick 
et aJ., 1996). Together, these site-directed mutagen- 
esis and deletion mutagenesis studies have verified 
that many substitutions of conserved amino acid 
residues lower enzyme activity, and in some cases, 
alter substrate specificity. 

Applied molecular evolution of the polymerase 
active site 

To determine the contribution of specific active 
site residues to polymerase function and to evalu- 
ate the degree of plasticity within the polymerase 
active site in vivo, we randomly mutated 13 resi- 
dues each within the two most highly conserved 
regions within DNA polymerases: motif A (amino 
acid residues 605-617 in Taq Pol I (Patel & Loeb, 
2000a)) and motif B (amino acid residues 659-671 
in Taq Pol I (Suzuki et ai, 1996b)). Briefly, these 
experiments involved replacement of nucleotide 
sequence encoding amino acid residues with a par- 
tially randomized sequence, such that each amino 
acid can be altered to potentially any of the other 
19. When coupled with a stringent selection 
scheme, one can determine the nature of allowable 
amino acid substitutions in vivo after sequencing 
selected mutants. Functional mutants were selected 
by complementation of E. coU recA718 polA12, a 
strain that contains a temperature-sensitive 
mutation in the polA gene and can be propagated 
at 30 °C, but not at 37 °C (Sweasy & Loeb, 1992; 
Uyemura & Lehman, 1976). Taq Pol I can fully 
restore the temperature-sensitive phenotype such 
that E. coli recA718 polAU harboring Taq Pol I exhi- 
bits a 100% survival rate at 37 °C relative to 30 °C 
(Patel & Loeb, 2000a). 

Following transformation of the motif A library 
into Pol I deficient E. coli, we obtained 8000 active 
mutants, of which 291 were characterized for 
activity, fidelity and ability to incorporate ribonu- 
cleotides (Patel & Loeb, 2000a). Sequence analysis 
of all 291 selected active Taq Pol I clones indicated 
that 8 out of 13 motif A amino acid residues toler- 
ate a wide spectrum of substitutions, four amino 
acid residues tolerate conservative substitutions 
and only one amino acid residue (Asp610) is 
immutable. Comparison of a high-resolution Taq 
Pol I boimd to DNA and dNTP complex shows 
that the immutable residue (Asp610) functions to 
coordinate the metal-mediated catalysis reaction, 
leading to the incorporation of the incoming 
nucleotide. Those residues that tolerate predomi- 
nantly conservative substitutions stabilize tertiary 
structure. For example, Glu615, which hydrogen 
bonds with O-helix side-chain Tyr617, can only be 
substituted by an Asp, and Tyr611, wliich projects 
into a hydrophobic pocket, can only be replaced by 
other planar amino acid residues. Most other motif 
A amino acid residues can be substituted to a wide 
range of amino acids that differ in size, shape and 
charge, frequently without compromising activity. 



Interestingly, the conserved amino acid residues 
from our random mutagenesis experiments corre- 
late well with motif A consensus sequence found 
in nature (Figure 3). 

Transformation into £. coli recA718 polAll of 
motif B library containing random substitutions at 
amino acid residues 659-671 also yielded an exten- 
sive library of highly active DNA polymerases that 
contained diverse amino acid substitutions (Suzuki 
et al., 1996b). Sequence analysis of the selected 
active clones showed only two of the 13 amino 
acid residues are immutable (Arg659 and Lys663), 
two tolerate conservative substitutions (Phe667 
and Tyr671), and all others tolerate a wide range 
of substitutions. Structural analysis shows the 
immutable and conservatively substitutable amino 
acid residues interact with the incoming dNTP. 
These results are largely consistent with sequence 
aligrunent data, which show within prokaryotic 
and bacteriophage Pol I polymerases only motif B 
amino acid residues Arg659, Lys663, Gly668, and 
Tyr671 are conserved, and Phe667 tolerates only 
conservative substitutions. The random mutagen- 
esis data suggest only 3 out of 26 active site (motif 
A and B) residues analyzed are required for func- 
tion in vivo, six can tolerate only conservative sub- 
stitutions, and all other residues are highly 
substitutable. This inherent plasticity of the Taq Pol 
I active site is consistent with Pol I consensus 
sequence found in r\ature (Figure 3). Taken 
together, these data indicate that the DNA poly- 
merase active site is highly mutable. 

Altered substrate specificity 

DNA polymerases have an active site architec- 
ture that must specifically configure to and incor- 
porate each of the four deoxynucleoside 
triphosphates, while taking direction from tem- 
plates with diverse nucleotide sequences. In 
addition, the active site must exclude altered 
nucleotides produced during cellular metabolism, 
Tlie amino acid residues of motifs A and B 
are positioned to have a potentially important 
contribution toward DNA polymerase fidelity 
and substrate specificity. The effect of amino 
acid substitutions within the active site of Taq 
DNA polymerase was analyzed using over 300 
active mutants. Mutant libraries were tested 
for base-pair fidelity and for the incorporation of 
ribonucleotides. 

Base-pairing fidelity 

The conformational changes at the catalytic site 
of DNA polymerases that guarantee the high fide- 
lity of nucleotide incorporation have not been ade- 
quately established. These changes must 
acconunodate each of the complementary, yet 
structurally different base-pairings. One model that 
addresses the fidelity requirements is that during 
nucleotide incorporation, stable base stacking inter- 
actions and base-pairing allow time for a confor- 
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mational change, which brings the a-phosphate 
towards the primer 3'-0H (Patel et al, 1995). If a 
non-complementary nucleotide were to bind, then 
the base-pairing and stacking interacting would be 
disrupted, and as a result, the conformational 
change would be retarded. In this model, low fide- 
lity can be conferred by mutations that interfere 
with the transitions between "open" and "closed" 
conformations. Two types of mutations reduce 
fidelity: (1) mutations that stabilize the closed con- 
formation, even in the absence of proper base-pair- 
ing and stacking; and (2) mutations that "widen" 
the hydrophobic pocket to accommodate binding 
of non-complementary nucleotides. Substitutions 
within the O-helix that confer low fidelity include 
Ala661Glu and Thr664Arg (Suzuki el al, 1997, 
2000). These residues are located near the distal 
portion of the fingers subdomain, adjacent to 
amino acid residues that interact with the tripho- 
sphate group (Arg659 and Lys663) and away from 
the base stacking residues (Tyr671 and Phe667) 
positioned near the palm subdomain. These amino 
acid substitutions confer low fidelity by stabilizing 
the closed conformation. Substitutions within motif 
A that confer low fidelity include hydrophilic sub- 
stitutions at position 614 (Patel et al. 2001). These 
mutants exhibit a tenfold higher efficiency of mis- 
inserting bases, as well as at least tenfold higher 
efficiency in extending mismatches. Following 
PGR, Taq Pol I mutants containing hydrophilic resi- 
due at position 614 exhibit up to 100-fold higher 
error rates relative to the wild-type enzyme and 
efficiently catalyze both transition and transversion 
errors. In addition to conferring low base-pairing 
fidelity, hydrophilic substitutions for Ile614 also 
allow mutant Tac\ Pol 1 to bypass blocking template 
lesions such as abasic site and vinyl chloride alky- 
lation product ethenoA. Hydrophilic substitutions 
for Ile614 presumably confer a low fidelity by 
"widening" the dNTP-binding pocket to accommo- 
date unusual template base and incoming nucleo- 
tide structures (for further discussion, see Patel 
et al, 2001). 

Incorporation of ribonucleotides 

In cells, the concentration of ribonucleotides is 
orders of magnitude greater than that of the corre- 
sponding deoxynucleotides. As a result, DNA 
polymerases evolved mechanisms to prevent the 
incorporation of ribonucleotides in DNA (Joyce, 
1997). We analyzed this property by testing 291 
"motif A mutant enzymes" for the ability to syn- 
thesize RNA. Twenty-three different mutant poly- 
merases containing substitutions in one of two 
amino acid residues were identified that incorpor- 
ated ribonucleotides at a rate approaching lO^'-fold 
greater than that of wild-type Taq DNA polymer- 
ase (Patel & Loeb, 2000b). Many of the 23 ribonu- 
cleotide incorporating mutants contained multiple 
substitutions but could be divided into two major 
classes: (1) Those encoding a hydrophilic substi- 
tution at Ile614; and (2) those that encode a 



Glu615Asp substitution. Kinetic analysis of some 
of the mutants shows each incorporates ribonu- 
cleotides at an efficiency (kat/Km) 1000 times high- 
er relative to the wild-type enzyme. Analysis of the 
Taq Pol I structure model bound with DNA and 
rNTP shows the ribose ring of the rNTP interacts 
with Ile614 (specifically, with methyl group on the 
P-carbon). In addition, Ile614 amino acid residues 
in an important region of the protein located 
between an tx-helix and a (i-strand. Substitutions of 
Ile614, which is a highly mutable residue, inay 
allow rNTP binding and incorporation by disrupt- 
ing the shape of the hydrophobic pocket that inter- 
acts with the base and ribose portions of the 
incoming dNTP. The aliphatic portion of Glu615 
also forms a portion of this hydrophobic pocket. 
Thus, Glu615Asp substitution may similarly dis- 
rupt the overall conformation of the pocket to 
allow ribonucleotides incorporation; however, this 
disruption is not significant enough to alter base- 
pairing fidelity of Taq Pol I. Taken together, these 
data indicate that 3 out of the 13 motif A amino 
acid residues (Ile614, Glu615, and Arg617), when 
substituted, altered biochemical properties of Taq 
Pol I, while many other substitutions at the other 
amino acid residues appear to be neutral in 
biochemical properties. 



Evolution of the DNA Polymerase 
Active Site 

Our genetic selection protocol allows isolation of 
mutant polymerases that retain a high DNA poly- 
merase activity. Bacteria dependent on these over 
300 mutant active polymerases can be grown 
under logarithmic conditions in liquid broth at 
37 °C (prior to plasmid isolation and protein purifi- 
cation) or as colonies in solid agar at 37 °C (>50 
generations) without significant variations in 
growth kinetics. Thus, bacteria dependent on 
mutant enzymes for survival are fit to replicate 
repetitively. In addition, many enzymes containing 
substitutions within motif A exhibit wild-type-like 
DNA-dependant DNA polymerase biochemical 
activity. The random mutagenesis data suggest 
only 3 out of 26 active site (motif A and B) residues 
analyzed are required for function in vivo, and 
another three or four amino acid residues, when 
substituted, alter the substrate specificity of 
the enzyme. This high level of substitutability of 
evolutionary conserved amino acid residues in the 
polymerase active site indicates multiple sequences 
can encode functioning polymerases. 

This inlierent plasticity of the DNA polymerase 
active site suggests DNA polymerases from geo- 
graphically isolated populations of £. coli should 
contain diverse amino acid substitutions. Specifi- 
cally, we expect after 10* years of evolution 
(Ochman & Wilson, 1988), dividing >100 times 
each year (Gibbons & Kapsimalis, 1967) at a 
mutation rate of 10~^ per nucleodde/division (in 



Review: DNA Polymerase I 



835 



mutators; Mao et ai, 1997; Oliver et ah, 2000; 
Taddei et ai, 1997) to 10"' per nucleotide/division 
(in non-mutators) (Drake, 1991), E. coli polA genes 
might contain silent or neutral mutations at each 
codon position. However, analysis of E. coli strair\s 
isolated from 19 countries in four continents shows 
that each has a nearly identical amino acid and 
nucleotide sequence (>99% identity); nucleotides 
encoding motif A amino acid residues are 100% 
identical in all of these distinct E. coli straias. Con- 
sistent with the E. coli sequence data, two geo- 
graphically distinct strains of Rickettsia prowazekii 
(strains B and Madrid E) have identical amino acid 
and nucleotide sequences for the entire polA gene 
(Andersson & Andersson, 1999). These data show 
that there is a high sequence identity in the polA 
gene of diverse E. coli. Comparison of polA 
sequences from individual species within the Rick- 
ettsia genus showed the greatest degree of nucleo- 
tide sequence identity (as judged by the presence 
of synonymous codons) occurs within the DNA 
polymerase and 5' nuclease catalytic sites. For 
example, 24 amino acid residues within and flank- 
ing motif A are encoded by the identical codons in 
at least seven Rickettsia species. The frequency for 
this level of conservation for any one-codon pos- 
ition within the entire polA gene of Rickettsia is 0.8. 
Thus, the probability of 24 contiguous amino acid 
residues being encoded by the same codon by ran- 
dom chance is <0.005 (=0.8^"). 

In summary, we find strains of identical species, 
isolated from geographically diverse areas have 
nearly the same nucleotide sequence within the 
poIA gene, and species within the same genus have 
nearly identical active site nucleotide sequence, 
suggesting the polA gene evolves at a very slow 
rate. Thus, we are presented with two surprising 
and seemingly contradictory findings: DNA Pol I 
is highly plastic and allows multiple amino acid 
substitutions within its catalytic site, yet the polA 
gene is highly stable in nucleotide sequence within 
species. It is always conceivable that the wild-type 
amino acid sequence has a selective advantage 
over prolonged evolution; however, it seems 
improbable that selection would also prevail at the 
nucleotide sequence level. As an alternative, we 
propose that genetic transfer mechanisms maintain 
homogeneous sequences. Specifically, we propose 
the inherent plasticity of DNA polymerase enables 
tolerance of the high mutation burden during 
adverse conditior« diaracterized by selection of 
mutators (Mao et al, 1997) and facilitates the gen- 
eration of beneficial mutations with a short-term 
selective advantage. Following successful survival 
through periods of adverse conditions, the wild- 
type sequence (one that is fit and the most preva- 
lent) can be generated through genetic transfer, a 
process involving non-reciprocal recombination by 
substitution witfi wild-type sequences. Selection 
for a fit sequence could potentially be the driving 
force for genetic transfer. This model suggests that 
genetic transfer can function to maintain sequence 
homogeneity of DNA poljTnerase I and presum- 



ably other enzymes and could account for the 
dichotomy between inherent amino acid substitut- 
ability within diverse enzymes and the constancy 
of the nucleotide sequence found in nature. At the 
very least, the plasticity of the active site of DNA 
polymerase I, in contrast to the exceptional convSer- 
vation of this region during evolution, suggests 
that there are mechanisms for maintaining nucleo- 
tide sequences during evolution that need to be 
explored. 



Conclusions 

Diverse prokaryotic species contain DNA poly- 
merase I that are structurally identical and similar 
in sequence and in function. Furthermore, structur- 
al data indicate that very few (<10) amino acid 
residues within the highly conserved motifs A, B, 
and C have a direct role during nucleotide binding 
and incorporation. Random mutagenesis data indi- 
cate that only those residues that are important 
during catalysis and /or for protein folding need to 
be maintained; all other residues are mutable. 
While some substitutions alter the substrate speci- 
ficity of the polymerase, many do not. Bacteria 
dependent on these mutated polymerases for sur- 
vival are fit to replicate repetitively without signifi- 
cant variations in growtli kinetics; however, the 
long-term fitness of bacteria harboring these 
mutant enzymes is unknown. Sequence analysis of 
the gene encoding Pol I shows strains of identical 
species isolated from geographically diverse areas 
have nearly the same nucleotide sequence within 
the polA gene, and species within the same genus 
have nearly identical active site nucleotide 
sequence, suggesting that the polA gene evolves at 
a very slow rate. These combined data provide the 
underpinning for the use of E. coli DNA polymer- 
ase I or Taq DNA polymerase I as a prototype for 
studying mechanisms and functions of DNA poly- 
merases in diverse organisms. 
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